Networks of Photos, Landmarks, and People

نویسندگان

  • David Crandall
  • Noah Snavely
چکیده

Social photo-sharing sites like Flickr contain vast amounts of latent information about the world and human behavior. We describe our recent work in building automatic algorithms that analyze large collections of imagery in order to extract some of this information. At a global scale, we show how geo-tagged photographs can be used to identify the most photographed places on Earth, as well as to infer the names and visual representations of these places. At a local scale, we show that we can build detailed 3-d models of a scene by combining information from thousands of 2-d photographs taken by different people and from different vantage points. The dramatic growth of social content sharing websites has created immense collections of user-generated visual data online. Flickr.com alone currently hosts over 4 billion images taken by more than 40 million unique users [1], while Facebook.com grows by nearly 3 billion photos every month [2]. While users of these sites are primarily motivated by a desire to share photos with family and friends, collectively they are generating vast repositories of online information about the world and its people. Each of their photos is a visual observation of what a small part of the world looked like at a particular point in time and space. It is also a record of where a particular person (the photographer) was at a moment in time and what he or she was paying attention to. In aggregate, and in combination with the non-visual metadata available on photo sharing sites (including photo timestamps, geo-tags, captions, user profiles, and social contacts), these billions of photos present a rich source of information about the state of the world and the behavior of its people. In recent work, we have shown how vast photo collections like Flickr can be used to reconstruct information about the world at both global and local scales [3,4]. At a global level, we can create annotated maps of the world completely automatically, using the geo-tags on photos to reconstruct land boundaries, using tags to infer place names, and using visual analysis to find frequentlyphotographed scenes (an example is shown in Figure 1). We can also use this analysis to generate statistics about places, such as ranking landmarks by their popularity or studying which kinds of users visit which sites. At a more local level, we can use techniques from computer vision to automatically produce strikingly accurate 3-D models of a landmark, given a large number of 2-D photos taken by many different users from many different vantage points (see Figure 2). This work is part of a larger emerging research trend within computer science that is studying how to use publicly available data from online social networking sites to address questions in a range of fields in the humanities and social sciences [5]. Compared to traditional techniques like surveys and direct measurement, data collection from online social networking sources is of negligible cost and can be conducted at unprecedented scales. The challenge is that online data is largely unstructured, thus requiring sophisticated algorithms that can organize and extract meaning from noisy data. In our case, this involves developing automated techniques that can find patterns across millions of images. In this paper, we describe our recent work in using online photo collections to reconstruct the world at both global and local scales. Fig. 1. An annotated map of North America, automatically generated by analyzing nearly 35 million photos from Flickr. For each of the top 30 most photographed cities, the map shows the name of the city inferred from tags, the name of the most photographed landmark, and a representative photo of the landmark. (© David Crandall)

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Flickr Tags and Groups for Finding Landmark Photos

Many people take pictures of different city landmarks and post them to photo-sharing systems like Flickr. They also add tags and place photos in Flickr groups, created around particular themes. Using tags, other people can search for representative landmark images of places of interest. Searching for landmarks using tags results into many non-landmark photos and provides poor landmark summary f...

متن کامل

Developing a Recommendation Framework for Tourist by Mining Geo-tag Photos (Case Study Tehran District 6)

With the increasing popularity of sharing media on social networks and facilitating access to location technologies, such as Global Positioning System (GPS), people are more interested to share their own photos and videos. The world wide web users are no longer the sole consumer but they are producers of information also, hence a wealth of information are available on web 2.0 applications. The ...

متن کامل

Trip Pattern Mining Using Large Scale Geo-tagged Photos

Photo-sharing websites allows people to display their experiences on the Web through rich media data such as photos and videos. These photos contain spatial context in terms of latitude and longitude where the photo was taken. The geotagged photos disclose much information about people travel behavior and tourist density. As web-based and mobile-based technologies advance, geo-tagged photos are...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Feature Matching for Aligning Historical and Modern Images

Provision of historical information based on geographical location represents a new scope of connecting the present of a certain location or landmark with its history through a timescape panorama. This may be achieved by exploring a linear timeline of photos for certain areas and landmarks that have both historic and modern photos. Matching modern to historical images requires a special effort ...

متن کامل

What Your Images Reveal: Exploiting Visual Contents for Point-of-Interest Recommendation

The rapid growth of Location-based Social Networks (LBSNs) provides a vast amount of check-in data, which facilitates the study of point-of-interest (POI) recommendation. The majority of the existing POI recommendation methods focus on four aspects, i.e., temporal patterns, geographical influence, social correlations and textual content indications. For example, user’s visits to locations have ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011